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Abstract 


In this technical report, we present the results of a reliability study of the fourth-grade 
multiple choice reading comprehension measures available on the easyCBM learning 
system conducted in the spring of 20 1 1 . Analyses include split-half reliability, alternate 
form reliability, person and item reliability as derived from Rasch analysis, top / bottom 
reliability, and repeated measures analysis of variance (ANOVA). Results suggest 
adequate reliability for the fourth-grade multiple choice reading comprehension 
measures. 



Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 4 

Curriculum-based measures (CBMs) are standardized assessments with a rich history of 
use for screening students at risk for difficulty in reading as well as for tracking the progress 
students make in gaining essential skills over the course of a school year (Alonzo, Ketterlin- 
Geller, & Tindal, 2006). In recent years, the widespread adoption of Response to Intervention 
(RTI) as a model for instructional delivery and school-wide improvement efforts has resulted in 
renewed attention being given to CBMs and a greater emphasis being placed on their technical 
adequacy for a variety of uses. One concern expressed by practitioners and researchers alike is 
the degree to which the brief, individually-administered fluency-based probes most frequently 
identified with CBM are appropriate for use with older students. As students move from 
elementary to secondary school, there is some evidence to suggest that more complex CBMs, 
designed to measure more challenging constructs, such as reading comprehension and 
vocabulary in context may be more appropriate (Yovanoff, Duesbery, Alonzo, & Tindal, 2005). 
In this technical report, we describe a study of the reliability of the easyCBM fourth-grade 
multiple choice reading comprehension measures conducted in 201 1. 

Methods 

In this section, we describe the methods used in conducting a study of the split-half and 
top-bottom reliability, as well as Rasch analyses of grade 4 multiple-choice reading 
comprehension (MCRC) measures from the easyCBM 1 * assessment system. 

Setting and Participants 

The study was conducted in elementary and middle schools from two Pacific Northwest 
public school districts in the spring of 201 1. District A was diverse, and comprised of 
approximately 8,900 students, of which approximately 56% were White, 1 1% Hispanic, 15% 



Asian-Pacific Islander, 11% Multiracial, 7% Black, and 1% American Indian- Alaskan Native 
students. About 26% of students were eligible for free or reduced-priced meals. Students in 
District A outperformed their peers in the state on the statewide reading assessment. On average, 
more than 79% of students in grades 3-8 tested proficient on the statewide reading test, compared 
to about 67% for the state. In all, 27 teachers (six in grade 2, four in grade 3, five in grade 4, six 
in grade 5, and six in grade 6) and 715 students participated in the study from District A. 

District B was a large and diverse school district, of approximately 14,000 students, with 
a demographic make-up of approximately 56% White, 15% Hispanic, 11% Asian-Pacific 
Islander, 11% Multiracial, 6% Black, and 2% American Indian-Alaskan Native students. About 
34% of students in the district were eligible for free or reduced-priced meals. In 2010, students 
from District B slightly outperformed their peers in the state on the statewide reading 
assessment. On average, a little fewer than 69% of students in grades 3-8 tested proficient on the 
state reading test, compared to about 67% for the state. Fourth grade showed the largest 
difference between students scoring proficient for the district and state, 72% compared to 67%, 
respectively. Six teachers (two in grades 3 and 7, one in grades 4 and 8) and 317 students 
participated in the study from District B. 

Because we wanted to investigate the reliability for the full grade range of easyCBM R 
MCRC tests, we recruited participants from grades 2-8, with a goal of recruiting six teachers, 
with a corresponding six classes of students, from each of these seven grades. We successfully 
recruited six teachers for grades 2-6. Two teachers were recruited for grade 7 (seven total 
classes of students), and one for grade 8 (three total classes). The average class size across all 
grades was 27 students. Teachers were recruited at the district level and were compensated $150 
for participating in the study. The three participating teachers in grades 7 and 8 were given 



additional stipend money because they administered comprehension measures to more than one 
class of students. All students in attendance on the days the MCRC tests were administered 
participated in the study. 

Multiple-choice Reading Comprehension Measures 

The reading comprehension measures on easy CBM® are designed for group 
administration and are available for grades 2-8. Students first read an original work of narrative 
fiction and then answer multiple-choice questions (12 questions on the grade 2 test, 20 questions 
on each of the grade 3-8 tests) based on the story. Multiple-choice questions are designed to 
assess literal and inferential comprehension on all grade level tests; evaluative comprehension is 
also assessed on the grade 3-8 tests. Each question is comprised of the question stem and three 
possible answer choices: the correct answer and two incorrect but plausible distractors. The 
comprehension measures have a total of 12 points (grade 2) or 20 points (grades 3-8) possible; 
students earn one point for every question they answer correctly. 

We selected the format of the reading comprehension tests based on prior empirical work 
with local school districts (Alonzo & Tindal, 2004a, 2004b, 2004c). In this work, teachers had 
expressed their desire for tests that closely resembled the types of readings students regularly 
encountered in their classes. At the same time, concerns about increasing the reliability, ease of 
use, and cost-effectiveness of our measures prompted us to use selected response rather than 
open-ended question types in our comprehension measures. Accordingly, we developed the 
MCRC tests in a two-step process. First, we wrote the stories that were used as the basis for 
each test. Then, we wrote the test items associated with each story. We embedded quality 
control and content review processes in both these steps throughout instrument development. 

Two people, selected for their expertise in instrument development and language arts, 



were principally involved with overseeing the creation of the MCRC tests. The first person 
oversaw the creation and revision of the stories and test items earned her Bachelor of Arts degree 
in Literature from Carleton College in 1990, worked for twelve years as an English teacher in 
California public schools, was awarded National Board for Professional Teaching Standards 
certification in Adolescent and Young Adulthood English Language Arts in 2002, and earned her 
Ph.D. in the area of Learning Assessments/System Performance at the University of Oregon. 

The second person hired to write the MCRC items earned his Ph.D. in education psychology, 
measurement, and methodology from the University of Arizona. He has worked in education at 
the elementary and middle school levels, as well as in higher education and at the state level. He 
held a position as associate professor in the distance-learning program for Northern Arizona 
University and served as director of assessment for a large metropolitan school district in 
Phoenix, Arizona. In addition, he served as state Director of Assessment and Deputy Associate 
Superintendent for Standards and Assessment at the Arizona Department of Education. He was 
a test development manager for Harcourt Assessment and has broad experience in assessment 
and test development. 

Grade 4 test development. The two individuals hired to develop the grade 4 measures 
worked together to create documentation for story-writers to use while creating their stories. 

This written documentation was provided to increase the comparability of story structure and 
reduce the likelihood of construct irrelevant variance related to variation in story type affecting 
student performance on the different forms of the comprehension measures. Story creation 
specifications provided infonnation about the length of the stories (approximately 1500 words), 
characters, settings, and plots. Stories, which were composed between December 2005 and 
March 2006, were written by a variety of people who were either elementary and secondary 



school teachers or graduate students in the College of Education. In all, 28 stories were written 
for grades 4; 8 did not pass the criteria required for use in the assessment system, leaving 20 to 
be piloted. 

The professional item writer we hired created 20 multiple-choice questions, each with 3 
possible answer options, for each form of the grade 4 MCRC test. In all, he wrote 400 multiple- 
choice questions at the grade 4 level. All fourth-grade questions were written in March and 
April of 2006. For fourth-grade MCRC tests, we wrote seven questions targeting literal 
comprehension, seven questions targeting inferential comprehension, and six questions targeting 
evaluative comprehension, for a total of 20 items on each form of the test. Within each type of 
comprehension, item-writing specifications called for a range of difficulty such that each form of 
each test contained some easy, moderate, and difficult items in each of the types of 
comprehension assessed on that test. Item-writing specifications also guided the ordering of the 
items on each form of the MCRC test. In all cases, we followed a similar pattern of item 
ordering, beginning with the easiest literal comprehension item and continuing with items of 
increasing difficulty, ending with an item designed to be one of the most challenging, pulled 
from the highest level of comprehension assessed in that grade level (evaluative comprehension 
in grade 4). Once the multiple-choice items were written, the stories and item lists were 
formatted into individual tests, each comprised of a story and 20 multiple-choice test items. 
Alonzo, Liu and Tindal (2007) provided a detailed description of the development and technical 
adequacy of the grade 4 MCRC test. 

Grade 4 test selection and administration. We selected a subset of MCRC grade 4 
forms (roughly 60% of those available through the casyCBM ' assessment system) to use in this 
study. We used forms 8, 9, 10, 11, 12, 13, 14, 15, and 16 in this study. We selected the grade 4 



forms because higher form numbers are typically used less in the classroom compared to the 
lower-numbered assessment forms (e.g., forms 1-7) on which we have already completed earlier 
research; thus, we deemed further understanding fonn and item-level reliability statistics of the 
selected fonns a priority. 

Each student participated in the testing on three separate testing occasions in three 
different sessions, roughly one week apart. Each comprehension measure was group 
administered by the classroom teacher. In the first session, students completed a comprehension 
form assigned by class. Roughly one week later, students completed an alternate form of the 
comprehension measure. On the final testing occasion, students completed a third alternate 
form. To reduce the possibility of the order of the forms completed adversely affecting testing 
results, we assigned comprehension forms within a given grade at the class level based on a two- 
group counterbalanced measure design. For instance, the first of the six participating grade 4 
classes completed forms 11, 12, and 13, in that order, over the three testing occasions; the second 
grade 4 class completed fonns 13,12 and 1 1 (the opposite order of the first). We used the same 
counterbalanced measure design for all classes and all grades in the study. 

Analysis 

We used a variety of approaches to study the reliability of the easyCBM comprehension 
assessments: repeated measures analysis of variance, split half reliability using the Guttman 
formula, top/bottom reliability, and Rasch analysis. Each of these analytic approaches is 
explained in more detail in the following section. 

One-way repeated measures analysis of variance. To examine whether there was a 
significant difference in difficulty across the forms, we conducted one-way repeated measures 
analysis of variance (ANOVA). Each student completed three test fonns in each grade. When 



there was a statistically significant within- subject effect, the mean differences among the three 
forms were further analyzed to investigate where the significant within-subject difference 
resided. 

Split-half reliability. We conducted form and item-level reliability analyses for all 
grades in this study. To assess overall reliability of the MCRC measure, we examined the 
internal consistency among items within each selected test form using split-half reliability 
coefficients calculated from the Guttman formula using SPSS 19 (SPSS Inc., 2010). We used 
the Guttman formula to calculate split-half reliability coefficients because the Guttman formula 
does not assume homogeneity of test halves and will not overestimate the full-form reliability 
(Kerlinger & Lee, 2000). Thus, we felt the Guttman formula provided a more conservative and 
reasonable estimate of full test form reliability. 

Top-bottom reliability. We computed the total score based on the scored item-level 
data, with unanswered items scored as incorrect (i.e., “0”). The possible total score for grade 4 
forms is 20. Because casyCBM ' progress monitoring measures were developed to target 
students who are at-risk for academic failure, items should function differently for students who 
are at or below the 23rd percentile (i.e., lower percentiles) and those who are at or above the 78th 
percentile (i.e., higher percentiles). To evaluate the appropriateness of items, item functioning 
was compared between the two aforementioned groups. The scores corresponding to the 23rd 
and 78th percentiles were computed for each form. Then, the proportions of correct responses 
for each item for the two groups were analyzed. Both groups should demonstrate high 
proportions of correct responses for an easy item that functions appropriately. For a difficult 
item that is functioning appropriately, the proportion of correct responses for the lower percentile 
group should be lower than that for the higher percentile group. A higher proportion of correct 



responses for the lower percentile group indicates that the item may not be functioning 
appropriately. 

Rasch analyses. Data from the pilot testing of the MCRC measures were analyzed with 
a one-parameter logistic Rasch analysis using the software Winsteps 3.68.2 (Linacre, 2009). 
Unlike classical statistics, Rasch analyses consider patterns of responses across individuals, 
providing information at a level of specificity in results unattainable with approaches based on 
classical statistics used in the development of most CBMs. In a complex iterative process, a 
Rasch analysis concurrently estimates the difficulty of individual test items and the ability level 
of each individual test taker. The results, relevant to the discussion here, include an estimation 
of the difficulty (referred to as the ‘measure’) of each item, the standard error of measure 
associated with each item’s estimated difficulty, and the degree to which each item ‘fits’ the 
measurement model (referred to as the ‘mean square outfit’). In addition, a Rasch analysis can 
provide information about the average estimated ability of students who selected each of the 
possible answer choices. All of this information must be considered when evaluating the 
technical adequacy of the measures, as described below. 

Considering item estimated difficulty. Rasch analyses, which examine each item’s 
reliability, provide a more precise treatment of reliability than classical statistics, which examine 
the issue from a global test level. The most reliable estimation of a test-taker’s ability can be 
gained from tests comprised of items that represent the fullest range of difficulty possible for the 
population for which the test is intended. Thus, to evaluate the technical adequacy of our MCRC 
measures, we looked for items representing a range of difficulties. In Rasch analyses, this 
information is gleaned from examining each item’s measure. Easy items will have measures 
represented with negative numbers; difficult items will have measures represented with positive 



numbers. A measure of zero indicates an item that a person of average ability would be expected 
to have a 50% chance of getting correct. Thus, we sought a full range of measure on every 
MCRC test form. 

Examining the standard error of measure. Rasch analyses also provide information 
about the standard error of measure associated with the estimation of each item’s measure. 
Generally, the smaller the standard error of measure, the more reliable the estimation is. We 
sought small standard errors of measure for all items on our MCRC tests. 

Using the mean square outfit to evaluate goodness of fit. An additional piece of 
information used to evaluate technical adequacy in a Rasch model is the mean square outfit 
associated with each item. Values in the range of 0.50 to 1.50 are considered acceptable fit. 
Mean square outfits falling outside this acceptable range indicate the need for further evaluation 
of item functioning. In general, items with a mean square outfit less than 0.50 are considered 
less worrisome than items with mean square outfits higher than 1.50 because items falling into 
the fonner category perfonn more consistently (e.g., every student regardless of ability gets the 
item correct or incorrect) compared to items in the latter category that function more 
inconsistently (e.g., students who perform poorly on all other items, always get the item correct) 
(Linacre, 2002). In all cases, distractor analysis provides useful information to further evaluate 
the technical adequacy of each item. 

Analyzing distractor selection information. A distractor analysis provides infonnation 
on the average estimated ability of test takers who selected a particular distractor on a test. In 
evaluating the technical adequacy of an assessment instrument, one hopes to see that the correct 
answer is selected by test-takers with the highest average estimated ability and the remaining 
distractors are selected by test- takers with lower estimated abilities. In addition, every distractor 



in a well-constructed measure will be selected by at least some test-takers. We considered all of 
these features in evaluating the technical adequacy of the MCRC measures. 

Analyzing person and item reliability. Rasch analyses report both the person and item 
reliability. The person reliability is equivalent to the traditional test reliability. Low values 
indicate a narrow range of person measures, or a small number of items. Therefore, testing 
persons with more extreme abilities (high and low) or lengthening the measure would increase 
the person reliability. Winsteps’ item reliability has no traditional equivalent. Low item 
reliability values indicate a narrow range of item measures or a small sample. A larger sample of 
persons would increase item reliability. Low item reliability means that the sample size is too 
small to precisely locate the items on the latent variable (i.e., ability). 

Results 

In this section we report results from split-half and top-bottom reliability, as well as 
Rasch analyses of grade 4 multiple-choice reading comprehension (MCRC) measures from the 
casyCBM ' assessment system. It should be noted that results from grade 4 MCRC fonns 1 1, 12 
and 13 were dropped because the sample size was too small (n = 25 for forms 1 1 and 13, n = 24 
for form 12) for reliability and item-level Rasch analyses to be considered. 

Grade 4 Equivalence by Form 

In this section we report findings concerning the equivalence of MCRC forms. We used 
one-way repeated measures ANOVA to evaluate equivalence of difficulty across the MCRC 
forms. Because like groups of students took three MCRC forms, each roughly one week apart, 
we evaluated the difficulty equivalence of each set of the three forms that were taken by the 
same group of students. Mean differences between forms 8, 9, and 10 as well as forms 14, 15, 
and 16 were not significantly different. Tables 1-4 in Appendix A display descriptive statistics 



and the complete results of repeated measures ANOVA, as well as post-hoc analyses conducted 
to compare mean differences for the grade 4 MCRC measures used in the study. 

Grade 4 Split-half Reliability 

In this section we report overall reliability of the MCRC measure based on internal 
consistency among items within each selected test form using split-half reliability coefficients 
calculated with the Guttman fonnula. Split-half reliability coefficients were computed by 
comparing the results from the first 10 items of the MCRC measure to the second 10 items for all 
students in the sample taking each form. Some coefficients were calculated based on less than 
20 items ( e.g ., form 16 in grade 4) if a given item did not have enough variance to calculate 
reliability. For grade 4 MCRC forms 8, 9, 10, 14, 15 and 16, Guttman split-half reliability 
coefficients ranged from .38 to .67. Specifically, the split-half coefficient for form 8 was .38 (n 
= 20 items); the split-half coefficient for form 9 was .57 (n = 20 items); the split-half coefficient 
for form 10 was .67 (n = 20 items); the split-half coefficient for form 14 was .56 (n = 20 items); 
the split-half coefficient for form 15 was .57 (n = 20 items); the split-half coefficient for form 16 
was .58 (n = 19 items). Tables 1-12 in Appendix B display descriptive statistics and complete 
results of split-half reliability analyses by fonn for grade 4 MCRC measures used in this study. 
Grade 4 Top-bottom Reliability 

In this section we report results from top-bottom reliability analysis used to evaluate the 
appropriateness of items. The proportion of correct responses of each item for low-perfonning 
(at or below the 23rd percentile) and high-performing (at or above the 78th percentile) students 
was evaluated from this analysis to examine the appropriateness of item functioning. For form 8, 
all students in the low-perfonning group answered 1 out of 20 items correctly, and 1 out of 
remaining 19 items incorrectly. The proportion of correct responses for the remaining 18 items 



ranged from .14 to .86. Every student in the high-performing group answered 9 out of 20 items 
correctly, and the proportion of correct responses for the remaining 1 1 items ranged from .42 to 
.92. For form 9, the proportion of correct responses ranged from .13 to .88 for the low- 
performing students. Every student in the high-performing group answered 5 out of 20 items 
correctly, and the proportion of correct responses for the remaining 15 items ranged from .50 to 
.94. For form 10, the proportion of correct responses ranged from .1 1 to .89 for the low- 
performing students. All students in the high-performing group answered 6 out of 20 items 
correctly; the proportion of correct responses for the remaining 14 items ranged from .47 to .93. 

For form 14, the proportion of correct responses ranged from .25 to .88 for the low- 
performing students. All students in the high-performing group answered 8 out of 20 items 
correctly, and the proportion of correct responses for the remaining 12 items ranged from .33 to 
.93. The proportion of correct responses for item #9 was higher for the low-performing students 
(.38) than the high-performing students (.35). For form 15, the proportion of correct responses 
ranged from .18 to .91 for the low-perfonning students. All students in the high-performing 
group answered 9 out of 20 items correctly, and the proportion of correct responses for the 
remaining 1 1 items ranged from .50 to .90. The proportion of correct responses for item #1 was 
higher for the low-performing students (.91) than the high-performing students (.90), though this 
difference is deemed very small given the sample size. For fonn 16, all students in the low- 
performing group answered 2 out of 20 items correctly, but 1 out of the remaining 18 items 
incorrectly. The proportion of correct responses ranged from .14 to .86 for the remaining 17 
items. Every student in the high-performing group answered 5 out of 20 items correctly, and the 
proportion of correct responses for the remaining 15 items ranged from .37 to .95. Tables 1-10 



in Appendix C display mean and percentile scores and the complete top-bottom reliability results 
for the grade 4 MCRC forms used in this study. 

Grade 4 Item-level Rasch Analyses 

Almost all items on the grade 4 MCRC form 8 test form passed the model fit selection 
criteria, falling within the mean square outfit range of 0.5 to 1.5. Items #3 and #4 exceeded the 
model fit selection criteria with mean square outfit of 1.80 and 1.65, respectively. Items #1 and 
#9 had mean square outfit of 0.46 and 0.41, falling below the criteria of 0.5. Distractor analysis 
indicated that these four items were functioning appropriately. Most of the items on grade 4 
MCRC form 9 were within the mean square outfit range of 0.5 to 1.50. Items #6 and #15 were 
over-fit, with mean square outfit of 2.40 and 1.63, respectively. Items #7, #14 and #16 were 
under-fit, with mean square outfit values of 0.46, 0.33, and 0.30, respectively. Analysis of the 
distractors, however, indicated only item #15 was not functioning appropriately. On grade 4 
MCRC form 10, item #11 was over- fit, with a mean square outfit of 1.67. Item #3 was under- fit, 
with a mean square outfit value of 0.08. Analysis of the distractors indicated that both items 
were not functioning appropriately. 

All items on the grade 4 MCRC form 14 had mean square outfit within the acceptable 
criteria of 0.50 to 1.50, with the exception of item #8. Item #8 was over-fit, with a mean square 
outfit of 1.55. Distractor analysis, however, indicated that item #8 was functioning 
appropriately, though items #6 and #9 may not be. Almost all of the items on the grade 4 MCRC 
form 15 had mean square outfit within the acceptable criteria of 0.50 to 1.50. Item #1 had a 
mean square outfit of 2.00. Distractor analysis also suggested that this item was not functioning 
properly. On test fonn grade 4 MCRC form 16, three items did not meet the acceptable criteria 
of mean square outfit of 0.50 to 1.50. The two items, #10 and #19, were over-fit, with mean 



square outfit of 1.63 and 1.90, respectively. Item #3 was under-fit, with a mean square outfit of 
0.37. Results from the distractor analysis indicated that only item #3 was not functioning 
appropriately. 

Person reliability values were moderate overall, ranging from .56 to .67. Item reliability 
was generally high, ranging from .86 to .92. Tables 1-12 in Appendix D display the item 
measure, standard error of measure, mean square outfit, and complete distractor analyses for the 
six grade 4 MCRC measures used in this study. 

Discussion 

Our findings in this study suggest that the grade 2 easyCBM multiple choice reading 
comprehension measures have acceptable levels of reliability for the two purposes for which 
they are intended: as one part of a battery of assessments administered in the fall, winter, and 
spring to screen students at risk for reading difficulty, and as repeated measures over time as 
used to monitor student progress in developing comprehension skill. Although we would have 
preferred to have even higher alternate fonn reliability coefficients, it appears likely that our 
results are dampened by two factors. First, sample sizes in our study were not as large as we 
would have liked, due to challenges related to participant recruitment. Second, it appears as 
though the test forms might have had a ceiling effect, with very little variation in scores for 
students who were in the top third (reducing the power of the top / bottom reliability analyses). 


Future research should address both these limitations. 



References 


Alonzo, J., Ketterlin-Geller, L.R., & Tindal, G. (2006). Curriculum-based measurement in 

reading and math: providing rigorous outcomes to support learning. In L. Florian (Ed.), 
The Sage Handbook of Special Education (pp. 307-318). Thousand Oaks, CA: Sage. 

Alonzo, J., Liu, K., & Tindal, G. (2007). Examining the technical adequacy of reading 

comprehension measures in a progress monitoring assessment system (Technical Report 
No. 41). Eugene, OR: Behavioral Research and Teaching, University of Oregon. 

Alonzo, J., & Tindal, G. (2004a). Analysis of reading fluency and comprehension measures for 
first-grade students (Technical Report No. 25). Eugene, OR: University of Oregon, 
College of Education, Behavioral Research and Teaching. 

Alonzo, J., & Tindal, G. (2004b). Analysis of reading fluency and comprehension measures for 
fourth-grade students (Technical Report No. 27). Eugene, OR: University of Oregon, 
College of Education, Behavioral Research and Teaching. 

Alonzo, J., & Tindal, G. (2004c). Technical report: District reading assessments, spring 2004 

administration (Technical Report No. 30). Eugene, OR: University of Oregon, College of 
Education, Behavioral Research and Teaching. 

Kerlinger, F. N., & Lee, H. B. (2000). Foundations of Behavioral Research (Fourth ed.). New 
York: Thomspon Learning, Inc. 

Linacre, J. M. (2002). What do inlit and outfit, mean-square and standardized mean? Rasch 
Measurement and Transactions, 16(2), 878. 

Linacre, J. M. (2009). WINSTEPS Rasch measurement computer program: version 3.68.2. 

SPSS Inc. (2010). SPSS for Macintosh License Agreement. Chicago, IL: SPSS Inc. 



Yovanoff, P., Duesbery, L., Alonzo, J., & Tindal, G. (2005). Grade level invariance of a 

theoretical causal structure predicting reading comprehension with vocabulary and oral 
reading fluency. Educational Measurement: Issues and Practice, 4-12. 



Appendix A 


p. 17 


Table 1 


Descriptive Statistics of Grade 4 Measures 8 to 10 



Mean 


Std. Deviation 

N 


total 8 


14.3636 


2.98127 


44 

total 9 


13.9545 


2.95670 


44 

total 10 


13.6818 


3.21170 


44 

Table 2 







Tests ofWithin-Subjects Effects for Grade 4 Measures 8 to 10 






Type III Sum of 





Source 


Squares 

df 

Mean Square 

F 

Sift. 

forms 

Sphericity Assumed 

10.364 

2 

5.182 

1.436 

.243 


Greenhouse-Geisser 

10.364 

1.924 

5.385 

1.436 

.244 


Huynh-Feldt 

10.364 

2.000 

5.182 

1.436 

.243 


Lower-bound 

10.364 

1.000 

10.364 

1.436 

.237 

Error(forms) 

Sphericity Assumed 

310.303 

86 

3.608 




Greenhouse-Geisser 

310.303 

82.749 

3.750 




Huynh-Feldt 

310.303 

86.000 

3.608 




Lower-bound 

310.303 

43.000 

7.216 




2 

Note. Mauchly’s Test of Sphericity: The assumption of sphericity was not violated, Mauchly's W was 0.96, % (2) = 


1.68, p > .05 



Table 3 

Descriptive Statistics of Grade 4 Measures 14 to 16 


p. 18 



Mean 

Std. Deviation 

N 


total 14 

13.0851 

2.99151 


47 

total 15 

13.6170 

3.28735 


47 

total 16 

12.7872 

3.20283 


47 


Table 4 


Tests of Within-Subjects Effects for Grade 4 Measures 14 to 16 


Source 


Type III Sum of 
Squares 

df 

Mean Square 

F 

Sig. 

forms 

Sphericity Assumed 

16.610 

2 

8.305 

1.353 

.264 


Greenhouse-Geisser 

16.610 

1.967 

8.444 

1.353 

.264 


Huynh-Feldt 

16.610 

2.000 

8.305 

1.353 

.264 


Lower-bound 

16.610 

1.000 

16.610 

1.353 

.251 

Error(forms) 

Sphericity Assumed 

564.723 

92 

6.138 




Greenhouse-Geisser 

564.723 

90.485 

6.241 




Huynh-Feldt 

564.723 

92.000 

6.138 




Lower-bound 

564.723 

46.000 

12.277 




2 

Note. Mauchly ’s Test of Sphericity: The assumption of sphericity was not violated, Mauchly's W was 0.98, % (2) = 


0.16, p > .05. 



Appendix B 


Table 1 

Grade 4 Split-Half Coefficients for MCRC Form 8 with N = 20 Items 


Cronbach's Alpha 

Part 1 Value 

.622 


N of Items 

10 a 


Part 2 Value 

.511 


N of Items 

10 b 


Total N of Items 

20 

Correlation Between Forms 


.237 

Spearman-Brown Coefficient 

Equal Length 

.384 


Unequal Length 

.384 

Guttman Split-Half Coefficient 


.383 


a. The items are: QlCorr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr, QlOCorr. 

b. The items are: Q1 ICorr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, Q16_Corr, 
Q17_Corr, Q18_Corr, Q19_Corr, Q20_Corr. 

Table 2 


Grade 4 Split-Half Scale Statistics for MCRC Form 8 with N = 20 Items 


Mean 

Variance 

Std. Deviation 

N of Items 

Part 1 

7.70 

3.492 

1.869 

10 a 

Part 2 

7.16 

3.251 

1.803 

10 b 

Both Parts 

14.86 

8.342 

2.888 

20 


a. The items are: Ql Corr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr, QlO Corr. 

b. The items are: Q1 I Corr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, Q16_Corr, 
Q17_Corr, Q18_Corr, Q19_Corr, Q20_Corr. 



Table 3 

Grade 4 Split-Half Coefficients for MCRC Form 9 with N = 20 Items 


Cronbach's Alpha 

Part 1 Value 

.496 


N of Items 

10 a 


Part 2 Value 

.366 


N of Items 

10 b 


Total N of Items 

20 

Correlation Between Forms 


.404 

Spearman-Brown Coefficient 

Equal Length 

.576 


Unequal Length 

.576 

Guttman Split-Half Coefficient 


.574 


a. The items are: QlCorr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr, QlOCorr. 

b. The items are: Q1 ICorr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, Q16_Corr, 
Q17_Corr, Q18_Corr, Q19_Corr, Q20_Corr. 

Table 4 


Grade 4 Split-Half Scale Statistics for 

MCRC Form 9 with N = 20 Items 



Mean 

Variance 

Std. Deviation 

N of Items 

Part 1 

7.34 

3.067 

1.751 

10 a 

Part 2 

6.75 

2.564 

1.601 

10 b 

Both Parts 

14.09 

7.899 

2.810 

20 


a. The items are: Ql Corr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr, QlO Corr. 

b. The items are: Q1 I Corr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, Q16_Corr, 
Q17_Corr, Q18_Corr, Q19_Corr, Q20_Corr. 



Table 5 

Grade 4 Split-Half Coefficients for MCRC Form 10 with N = 20 Items 


Cronbach's Alpha 

Part 1 Value 

.486 


N of Items 

10 a 


Part 2 Value 

.423 


N of Items 

10 b 


Total N of Items 

20 

Correlation Between Forms 


.506 

Spearman-Brown Coefficient 

Equal Length 

.672 


Unequal Length 

.672 

Guttman Split-Half Coefficient 


.672 


a. The items are: QlCorr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr, QlOCorr. 

b. The items are: Q1 ICorr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, Q16_Corr, 
Q17_Corr, Q18_Corr, Q19_Corr, Q20_Corr. 

Table 6 


Grade 4 Split-Half Scale Statistics for MCRC Form 10 with A = 20 Items 


Mean 

Variance 

Std. Deviation 

N of Items 

Part 1 

7.15 

3.186 

1.785 

10 a 

Part 2 

7.18 

3.046 

1.745 

10 b 

Both Parts 

14.33 

9.386 

3.064 

20 


a. The items are: Ql Corr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr, QlO Corr. 

b. The items are: Q1 I Corr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, Q16_Corr, 
Q17_Corr, Q18_Corr, Q19_Corr, Q20_Corr. 



Table 7 

Grade 4 Split-Half Coefficients for MCRC Form 14 with N = 20 Items 


Cronbach's Alpha 

Part 1 Value 

.318 


N of Items 

10 a 


Part 2 Value 

.465 


N of Items 

10 b 


Total N of Items 

20 

Correlation Between Forms 


.391 

Spearman-Brown Coefficient 

Equal Length 

.562 


Unequal Length 

.562 

Guttman Split-Half Coefficient 


.557 


a. The items are: QlCorr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr, QlOCorr. 

b. The items are: Q1 I Corr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, Q16_Corr, 
Q17_Corr, Q18_Corr, Q19_Corr, Q20_Corr. 

Table 8 


Grade 4 Split-Half Scale Statistics for MCRC Form 14 with N = 20 Items 


Mean 

Variance 

Std. Deviation 

N of Items 

Part 1 

7.24 

2.371 

1.540 

10 a 

Part 2 

6.20 

3.255 

1.804 

10 b 

Both Parts 

13.44 

7.798 

2.792 

20 


a. The items are: Ql Corr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr, QlO Corr. 

b. The items are: Q1 I Corr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, Q16_Corr, 
Q17_Corr, Q18_Corr, Q19_Corr, Q20_Corr. 



Table 9 

Grade 4 Split-Half Coefficients for MCRC Form 15 with N = 20 Items 


Cronbach's Alpha 

Part 1 Value 

.478 


N of Items 

10 a 


Part 2 Value 

.556 


N of Items 

10 b 


Total N of Items 

20 

Correlation Between Forms 


.399 

Spearman-Brown Coefficient 

Equal Length 

.571 


Unequal Length 

.571 

Guttman Split-Half Coefficient 


.568 


a. The items are: QlCorr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr, QlOCorr. 

b. The items are: Q1 ICorr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, Q16_Corr, 
Q17_Corr, Q18_Corr, Q19_Corr, Q20_Corr. 

Table 10 


Grade 4 Split-Half Scale Statistics for MCRC Form 15 with N = 20 Items 


Mean 

Variance 

Std. Deviation 

N of Items 

Part 1 

6.98 

3.074 

1.753 

10 a 

Part 2 

6.90 

3.840 

1.960 

10 b 

Both Parts 

13.88 

9.660 

3.108 

20 


a. The items are: Ql Corr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr, QlO Corr. 

b. The items are: Q1 I Corr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, Q16_Corr, 
Q17_Corr, Q18_Corr, Q19_Corr, Q20_Corr. 



Table 1 1 

Grade 4 Split-Half Coefficients for MCRC Form 16 with N = 19 Items 


Cronbach's Alpha 

Part 1 Value 

.091 


N of Items 

9 a 


Part 2 Value 

.308 


N of Items 

10 b 


Total N of Items 

19 

Correlation Between Forms 


.431 

Spearman-Brown Coefficient 

Equal Length 

.603 


Unequal Length 

.603 

Guttman Split-Half Coefficient 


.584 


a. The items are: QlCorr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr. 

b. The items are: QlOCorr, Q1 ICorr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, 
Q16_Corr, Q17_Corr, Q18_Corr, Q19_Corr. 


Table 12 

Grade 4 Split-Half Scale Statistics for MCRC Form 16 with N = 19 Items 

Mean Variance Std. Deviation N of Items 


Part 1 

6.55 

1.513 

1.230 

9 a 

Part 2 

5.98 

2.760 

1.661 

10 b 

Both Parts 

12.53 

6.037 

2.457 

19 


a. The items are: Ql Corr, Q2_Corr, Q3_Corr, Q4_Corr, Q5_Corr, Q6_Corr, Q7_Corr, 
Q8_Corr, Q9_Corr. 

b. The items are: QlO Corr, Q1 I Corr, Q12_Corr, Q13_Corr, Q14_Corr, Q15_Corr, 
Q16_Corr, Q17_Corr, Q18_Corr, Q19_Corr. 



Appedix C 


p. 25 


Table 1 


Grade 4 Mean and the Percentile Scores by Form 


Form 

Mean (n) 

23 rd Percentile («) 

78 th Percentile («) 

8 

13.98 (49) 

12(7) 

17(12) 

9 

13.76 (42) 

11(8) 

16(16) 

10 

13.71 (49) 

12(9) 

16(15) 

14 

13.10(51) 

11(8) 

15 (15) 

15 

13.55 (49) 

11(11) 

17(10) 

16 

12.66 (53) 

10(7) 

15(19) 
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Table 2 

Item Statistics for Students for Grade 4 Form 8 



23 rd Percentile or Below 

78 * 

Percentile or Above 


Mean 

Std. Deviation 

N 

Mean 

Std. Deviation 

N 

QlCorr 

.71 

.488 

7 

1.00 

.000 

12 

Q2_Corr 

.29 

.488 

7 

1.00 

.000 

12 

Q3_Corr 

.86 

.378 

7 

.83 

.389 

12 

Q4_Corr 

.29 

.488 

7 

.42 

.515 

12 

Q5_Corr 

.71 

.488 

7 

1.00 

.000 

12 

Q6_Corr 

.57 

.535 

7 

1.00 

.000 

12 

Q7_Corr 

.29 

.488 

7 

.83 

.389 

12 

Q8_Corr 

.29 

.488 

7 

1.00 

.000 

12 

Q9_Corr 

.86 

.378 

7 

1.00 

.000 

12 

QlOCorr 

.14 

.378 

7 

.83 

.389 

12 

QllCorr 

.57 

.535 

7 

.92 

.289 

12 

Q 1 2_Corr 

.71 

.488 

7 

1.00 

.000 

12 

Q13_Corr 

.57 

.535 

7 

.75 

.452 

12 

Q 1 4_Corr 

.29 

.488 

7 

.92 

.289 

12 

Q15_Corr 

1.00 

.000 

7 

1.00 

.000 

12 

Q 1 6_Corr 

.86 

.378 

7 

1.00 

.000 

12 

Q 1 7_Corr 

.57 

.535 

7 

.92 

.289 

12 

Q 1 8_Corr 

.00 

.000 

7 

.67 

.492 

12 

Q 1 9_Corr 

.57 

.535 

7 

.92 

.289 

12 

Q20 Corr 

.29 

.488 

7 

.92 

.289 

12 
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Table 3 

Item Statistics for Students for Grade 4 Form 9 




23 rd Percentile or Below 


78 th Percentile or Above 


Mean 

Std. Deviation 

N 

Mean 

Std. Deviation 

N 

QlCorr 

.75 

.463 

8 

1.00 

.000 

16 

Q2_Corr 

.38 

.518 

8 

.87 

.342 

16 

Q3_Corr 

.75 

.463 

8 

.94 

.250 

16 

Q4_Corr 

.38 

.518 

8 

.69 

.479 

16 

Q5_Corr 

.75 

.463 

8 

.81 

.403 

16 

Q6_Corr 

.63 

.518 

8 

.94 

.250 

16 

Q7_Corr 

.75 

.463 

8 

1.00 

.000 

16 

Q8_Corr 

.50 

.535 

8 

.94 

.250 

16 

Q9_Corr 

.13 

.354 

8 

.75 

.447 

16 

QlOCorr 

.38 

.518 

8 

.94 

.250 

16 

QllCorr 

.50 

.535 

8 

.87 

.342 

16 

Q 1 2_Corr 

.13 

.354 

8 

.75 

.447 

16 

Q13_Corr 

.88 

.354 

8 

1.00 

.000 

16 

Q 1 4_Corr 

.75 

.463 

8 

1.00 

.000 

16 

Q15_Corr 

.38 

.518 

8 

.50 

.516 

16 

Q 1 6_Corr 

.75 

.463 

8 

1.00 

.000 

16 

Q 1 7_Corr 

.25 

.463 

8 

.75 

.447 

16 

Q 1 8_Corr 

.13 

.354 

8 

.63 

.500 

16 

Q 1 9_Corr 

.50 

.535 

8 

.87 

.342 

16 

Q20 Corr 

.13 

.354 

8 

.56 

.512 

16 
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Table 4 

Item Statistics for Students for Grade 4 Form 10 




23 rd Percentile or Below 


78 th Percentile or Above 


Mean 

Std. Deviation 

N 

Mean 

Std. Deviation 

N 

QlCorr 

.56 

.527 

9 

1.00 

.000 

15 

Q2_Corr 

.67 

.500 

9 

1.00 

.000 

15 

Q3_Corr 

.89 

.333 

9 

1.00 

.000 

15 

Q4_Corr 

.56 

.527 

9 

.87 

.352 

15 

Q5_Corr 

.44 

.527 

9 

.87 

.352 

15 

Q6_Corr 

.22 

.441 

9 

.47 

.516 

15 

Q7_Corr 

.44 

.527 

9 

1.00 

.000 

15 

Q8_Corr 

.67 

.500 

9 

.80 

.414 

15 

Q9_Corr 

.22 

.441 

9 

1.00 

.000 

15 

QlOCorr 

.33 

.500 

9 

.67 

.488 

15 

QllCorr 

.78 

.441 

9 

.73 

.458 

15 

Q 1 2_Corr 

.56 

.527 

9 

.47 

.516 

15 

Q13_Corr 

.33 

.500 

9 

.87 

.352 

15 

Q 1 4_Corr 

.56 

.527 

9 

.93 

.258 

15 

Q15_Corr 

.56 

.527 

9 

1.00 

.000 

15 

Q 1 6_Corr 

.78 

.441 

9 

.80 

.414 

15 

Q 1 7_Corr 

.11 

.333 

9 

.53 

.516 

15 

Q 1 8_Corr 

.67 

.500 

9 

.93 

.258 

15 

Q 1 9_Corr 

.11 

.333 

9 

.87 

.352 

15 

Q20 Corr 

.44 

.527 

9 

.93 

.258 

15 
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Table 5 

Item Statistics for Students for Grade 4 Form 14 




23 rd Percentile or Below 


78 th Percentile or Above 


Mean 

Std. Deviation 

N 

Mean 

Std. Deviation 

N 

QlCorr 

.75 

.463 

8 

1.00 

.000 

15 

Q2_Corr 

.88 

.354 

8 

.93 

.258 

15 

Q3_Corr 

.63 

.518 

8 

1.00 

.000 

15 

Q4_Corr 

.25 

.463 

8 

.53 

.516 

15 

Q5_Corr 

.50 

.535 

8 

1.00 

.000 

15 

Q6_Corr 

.63 

.518 

8 

.60 

.507 

15 

Q7_Corr 

.38 

.518 

8 

1.00 

.000 

15 

Q8_Corr 

.25 

.463 

8 

1.00 

.000 

15 

Q9_Corr 

.38 

.518 

8 

.33 

.488 

15 

QlOCorr 

.25 

.463 

8 

.87 

.352 

15 

QllCorr 

.25 

.463 

8 

.93 

.258 

15 

Q 1 2_Corr 

.25 

.463 

8 

.40 

.507 

15 

Q13_Corr 

.25 

.463 

8 

.67 

.488 

15 

Q 1 4_Corr 

.63 

.518 

8 

1.00 

.000 

15 

Q15_Corr 

.25 

.463 

8 

.93 

.258 

15 

Q 1 6_Corr 

.88 

.354 

8 

1.00 

.000 

15 

Q 1 7_Corr 

.38 

.518 

8 

.87 

.352 

15 

Q 1 8_Corr 

.25 

.463 

8 

.33 

.488 

15 

Q 1 9_Corr 

.75 

.463 

8 

1.00 

.000 

15 

Q20 Corr 

.25 

.463 

8 

.87 

.352 

15 
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Table 6 

Item Statistics for Students for Grade 4 Form 15 




23 rd Percentile or Below 


78 th Percentile or Above 


Mean 

Std. Deviation 

N 

Mean 

Std. Deviation 

N 

QlCorr 

.91 

.302 

11 

.90 

.316 

10 

Q2_Corr 

.73 

.467 

11 

1.00 

.000 

10 

Q3_Corr 

.91 

.302 

11 

1.00 

.000 

10 

Q4_Corr 

.64 

.505 

11 

1.00 

.000 

10 

Q5_Corr 

.45 

.522 

11 

.80 

.422 

10 

Q6_Corr 

.45 

.522 

11 

1.00 

.000 

10 

Q7_Corr 

.27 

.467 

11 

.50 

.527 

10 

Q8_Corr 

.36 

.505 

11 

.90 

.316 

10 

Q9_Corr 

.18 

.405 

11 

.90 

.316 

10 

QlOCorr 

.09 

.302 

11 

.50 

.527 

10 

QllCorr 

.09 

.302 

11 

.70 

.483 

10 

Q 1 2_Corr 

.64 

.505 

11 

1.00 

.000 

10 

Q13_Corr 

.64 

.505 

11 

1.00 

.000 

10 

Q 1 4_Corr 

.18 

.405 

11 

1.00 

.000 

10 

Q15_Corr 

.82 

.405 

11 

1.00 

.000 

10 

Q 1 6_Corr 

.82 

.405 

11 

1.00 

.000 

10 

Q 1 7_Corr 

.36 

.505 

11 

.90 

.316 

10 

Q 1 8_Corr 

.36 

.505 

11 

.70 

.483 

10 

Q 1 9_Corr 

.55 

.522 

11 

.90 

.316 

10 

Q20 Corr 

.36 

.505 

11 

.90 

.316 

10 
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Table 7 

Item Statistics for Students for Grade 4 Form 16 



23 rd Percentile or Below 

78 * 

Percentile or Above 


Mean 

Std. Deviation 

N 

Mean 

Std. Deviation 

N 

QlCorr 

.71 

.488 

7 

1.00 

.000 

19 

Q2_Corr 

.86 

.378 

7 

.95 

.229 

19 

Q3_Corr 

1.00 

.000 

7 

1.00 

.000 

19 

Q4_Corr 

.57 

.535 

7 

.84 

.375 

19 

Q5_Corr 

1.00 

.000 

7 

1.00 

.000 

19 

Q6_Corr 

.14 

.378 

7 

.74 

.452 

19 

Q7_Corr 

.43 

.535 

7 

.74 

.452 

19 

Q8_Corr 

.57 

.535 

7 

.95 

.229 

19 

Q9_Corr 

.43 

.535 

7 

.89 

.315 

19 

QlOCorr 

.14 

.378 

7 

.42 

.507 

19 

QllCorr 

.71 

.488 

7 

.63 

.496 

19 

Q 1 2_Corr 

.29 

.488 

7 

.79 

.419 

19 

Q13_Corr 

.29 

.488 

7 

.68 

.478 

19 

Q 1 4_Corr 

.71 

.488 

7 

1.00 

.000 

19 

Q15_Corr 

.29 

.488 

7 

.47 

.513 

19 

Q 1 6_Corr 

.43 

.535 

7 

1.00 

.000 

19 

Q 1 7_Corr 

.14 

.378 

7 

.74 

.452 

19 

Q 1 8_Corr 

.00 

.000 

7 

.95 

.229 

19 

Q 1 9_Corr 

.43 

.535 

7 

.68 

.478 

19 

Q20 Corr 

.14 

.378 

7 

.37 

.496 

19 
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Table 1 

Item Statistics, Entry Order, Grade 4, Form 8 


Item Number 

Raw Score 

Count 

Measure 

Model Standard 
Error 

Mean Square 
Outfit 

1 

47 

49 

-2.53 

-0.22 

0.46 

2 

40 

49 

-0.61 

-0.23 

0.86 

3 

40 

49 

-0.61 

1.71 

1.80 

4 

20 

49 

1.70 

2.48 

1.65 

5 

43 

49 

-1.17 

-0.84 

0.52 

6 

37 

49 

-0.16 

0.18 

1.02 

7 

32 

49 

0.44 

-0.25 

0.93 

8 

35 

49 

0.09 

-1.05 

0.72 

9 

45 

49 

-1.70 

-0.76 

0.41 

10 

25 

49 

1.18 

-1.37 

0.75 

11 

31 

49 

0.55 

0.16 

1.02 

12 

39 

49 

-0.45 

-0.99 

0.64 

13 

30 

49 

0.66 

0.79 

1.15 

14 

29 

49 

0.77 

-1.51 

0.72 

15 

42 

49 

-0.96 

-0.10 

0.87 

16 

42 

49 

-0.96 

-0.68 

0.63 

17 

26 

49 

1.08 

0.85 

1.15 

18 

17 

49 

2.03 

0.62 

1.15 

19 

38 

49 

-0.30 

0.41 

1.11 

20 

27 

49 

0.97 

-0.46 

0.90 



Table 2 


Distractor Analysis, Grade 4, Form 8 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. Mean 


A 

0 

1 

2 

-0.24 

0.00 

1 

C 

0 

1 

2 

-0.24 

0.00 

B 

1 

47 

96 

1.30 

0.17 


Missing 

** 






B 

0 

1 

2 

-0.50 

0.00 


A 

0 

8 

16 

0.42 

0.23 

L 

C 

1 

40 

82 

1.44 

0.19 


Missing 

** 






A 

0 

2 

4 

1.86 

0.78 


C 

0 

7 

14 

0.76 

0.39 


B 

1 

40 

82 

1.29 

0.19 


Missing 

** 






A 

0 

2 

4 

0.94 

1.18 

A 

C 

1 

20 

41 

1.44 

0.29 


B 

0 

27 

55 

1.10 

0.22 


Missing 

** 






A 

0 

1 

2 

-1.05 

0.00 

C 

C 

0 

4 

8 

0.22 

0.31 


B 

1 

43 

88 

1.46 

0.16 


Missing 

** 

1 

2 

-2.14 

0.00 


B 

0 

3 

6 

0.20 

0.44 


C 

0 

6 

12 

1.16 

0.22 

0 

A 

1 

37 

76 

1.45 

0.19 


Missing 

** 

3 

6 

-0.26 

0.97 


C 

0 

7 

14 

0.21 

0.33 

n 

B 

0 

8 

16 

1.04 

0.34 

/ 

A 

1 

32 

65 

1.65 

0.18 


Missing 

** 

2 

4 

-1.06 

1.08 


B 

0 

1 

2 

1.38 

0.00 

8 

A 

0 

11 

22 

0.38 

0.22 

C 

1 

35 

71 

1.63 

0.18 


Missing 

** 

2 

4 

-1.06 

1.08 


B 

0 

1 

2 

0.27 

0.00 

o 

C 

0 

2 

4 

-0.24 

0.00 

y 

A 

1 

45 

92 

1.40 

0.16 


Missing 

** 

1 

2 

-2.14 

0.00 


C 

0 

3 

6 

1.43 

0.35 

10 

A 

0 

16 

33 

0.70 

0.18 

B 

1 

25 

51 

1.90 

0.20 


Missing 

** 

5 

10 

-0.53 

0.48 



Table 2 


Distractor Analysis, Grade 4, Form 8 (Continued) 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. 

Mean 


B 

0 

2 

4 

1.40 

0.32 

11 

A 

0 

11 

22 

0.98 

0.27 

C 

1 

31 

63 

1.60 

0.20 


Missing 

** 

5 

10 

-0.53 

0.48 


C 

0 

1 

2 

-0.24 

0.00 

12 

B 

0 

5 

10 

0.69 

0.14 

A 

1 

39 

80 

1.54 

0.17 


Missing 

** 

4 

8 

-0.72 

0.57 


A 

0 

6 

12 

1.07 

0.36 

13 

C 

0 

8 

16 

1.17 

0.31 

B 

1 

30 

61 

1.56 

0.21 


Missing 

** 

5 

10 

-0.42 

0.54 


C 

0 

4 

8 

0.41 

0.31 

14 

B 

0 

14 

29 

0.67 

0.2 

A 

1 

29 

59 

1.82 

0.18 


Missing 

** 

2 

4 

-1.59 

0.54 


A 

0 

1 

2 

1.08 

0.00 

15 

B 

0 

4 

8 

0.70 

0.38 

C 

1 

42 

86 

1.42 

0.17 


Missing 

** 

2 

4 

-1.59 

0.54 


B 

0 

2 

4 

0.14 

0.13 

16 

A 

0 

3 

6 

0.71 

0.24 

C 

1 

42 

86 

1.46 

0.16 


Missing 

** 

2 

4 

-1.59 

0.54 


B 

0 

4 

8 

0.62 

0.40 

17 

C 

0 

15 

31 

1.19 

0.22 

A 

1 

26 

53 

1.59 

0.23 


Missing 

** 

4 

8 

-0.26 

0.80 


C 

0 

4 

8 

0.15 

0.35 

18 

B 

1 

17 

35 

1.84 

0.31 

A 

0 

24 

49 

1.17 

0.15 


Missing 

** 

4 

8 

0.13 

1.07 


A 

0 

2 

4 

0.40 

0.13 

19 

B 

0 

6 

12 

0.87 

0.48 

C 

1 

38 

78 

1.49 

0.17 


Missing 

** 

3 

6 

-0.70 

0.94 


C 

0 

4 

8 

0.66 

0.08 

20 

A 

0 

15 

31 

0.84 

0.21 

B 

1 

27 

55 

1.75 

0.21 


Missing 

** 

3 

6 

-0.70 

0.94 
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Table 3 


Item Statistics, Entry Order, Grade 4 , Form 9 


Item Number 

Raw 

Score 

Count 

Measure 

Model Standard 
Error 

Mean Square 
Outfit 

1 

41 

46 

-1.33 

-0.35 

0.69 

2 

32 

46 

0.16 

-0.43 

0.87 

3 

35 

46 

-0.24 

0.21 

1.03 

4 

30 

46 

0.39 

1.73 

1.42 

5 

38 

46 

-0.71 

0.91 

1.37 

6 

39 

46 

-0.89 

2.26 

2.40 

7 

41 

46 

-1.33 

-0.90 

0.46 

8 

37 

46 

-0.54 

-0.06 

0.93 

9 

16 

46 

1.90 

1.19 

1.28 

10 

25 

46 

0.94 

-0.80 

0.86 

11 

37 

46 

-0.54 

0.37 

1.10 

12 

22 

46 

1.25 

-0.56 

0.90 

13 

41 

46 

-1.33 

-0.79 

0.50 

14 

42 

46 

-1.61 

-1.04 

0.33 

15 

19 

46 

1.57 

2.81 

1.63 

16 

42 

46 

-1.61 

-1.14 

0.30 

17 

30 

46 

0.39 

-0.27 

0.93 

18 

16 

46 

1.90 

0.57 

1.12 

19 

32 

46 

0.16 

0.58 

1.13 

20 

20 

46 

1.46 

0.06 

1.00 



Table 4 


Distractor Analysis, Grade 4, Form 9 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. 

Mean 


B 

0 

2 

4 

0.24 

0.00 

1 

C 

0 

3 

7 

0.25 

0.41 

A 

1 

41 

89 

1.25 

0.16 


Missing 

** 






B 

0 

4 

9 

0.63 

0.47 


A 

0 

10 

22 

0.31 

0.35 

L 

C 

1 

32 

70 

1.46 

0.15 


Missing 

** 






B 

0 

3 

7 

0.26 

0.47 

'l 

C 

0 

8 

17 

0.79 

0.30 


A 

1 

35 

76 

1.29 

0.18 


Missing 

** 






A 

0 

7 

15 

1.44 

0.29 

A 

C 

0 

9 

20 

0.58 

0.35 

‘y 

B 

1 

30 

65 

1.24 

0.19 


Missing 

** 






B 

0 

2 

4 

0.96 

0.45 

c 

A 

0 

6 

13 

0.82 

0.48 


C 

1 

38 

83 

1.20 

0.17 


Missing 

** 






C 

0 

3 

7 

-0.29 

0.56 


A 

0 

4 

9 

1.15 

0.88 

O 

B 

1 

39 

85 

1.25 

0.14 


Missing 

** 






C 

0 

1 

2 

-0.55 

0.00 

n 

B 

0 

3 

7 

-0.03 

0.54 

/ 

A 

1 

41 

89 

1.33 

0.14 


Missing 

** 

1 

2 

-1.41 

0.00 


A 

0 

3 

7 

0.62 

0.33 

8 

C 

0 

5 

11 

0.37 

0.58 

B 

1 

37 

80 

1.35 

0.14 


Missing 

** 

1 

2 

-1.41 

0.00 


B 

0 

13 

28 

0.83 

0.2 

o 

A 

0 

16 

35 

0.95 

0.14 

y 

C 

1 

16 

35 

1.73 

0.3 


Missing 

** 

1 

2 

-1.41 

-1.00 


C 

0 

6 

13 

0.28 

0.42 

10 

A 

0 

14 

30 

0.87 

0.11 

B 

1 

25 

54 

1.60 

0.19 


Missing 

** 

1 

2 

-1.41 

-1.00 



Table 4 


Distractor Analysis, Grade 4, Form 9 (Continued) 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. Mean 


A 

0 

1 

2 

0.24 

0.00 

11 

B 

0 

7 

15 

0.78 

0.37 

C 

1 

37 

80 

1.30 

0.16 


Missing 

** 

1 

2 

-1.41 

0.00 


C 

0 

5 

11 

0.53 

0.31 

12 

A 

0 

18 

39 

0.87 

0.22 

B 

1 

22 

48 

1.61 

0.19 


Missing 

** 

1 

2 

-1.41 

0.00 


B 

0 

2 

4 

-0.02 

0.53 

13 

C 

0 

2 

4 

-0.16 

0.95 

A 

1 

41 

89 

1.32 

0.14 


Missing 

** 

1 

2 

-1.41 

0.00 


A 

0 

1 

2 

0.24 

0.00 

14 

C 

0 

1 

2 

-1.11 

0.00 

B 

1 

42 

91 

1.31 

0.13 


Missing 

** 

2 

4 

-0.98 

0.43 


B 

0 

1 

2 

1.76 

0.00 

15 

C 

1 

19 

41 

1.28 

0.27 

A 

0 

25 

54 

1.1 

0.16 


Missing 

** 

1 

2 

-1.41 

0.00 


C 

0 

1 

2 

-1.11 

0.00 

16 

B 

0 

2 

4 

-0.29 

0.26 

A 

1 

42 

91 

1.32 

0.13 


Missing 

** 

1 

2 

-1.41 

0.00 


C 

0 

5 

11 

0.09 

0.27 

17 

A 

0 

10 

22 

0.92 

0.32 

B 

1 

30 

65 

1.47 

0.15 


Missing 

** 

1 

2 

-1.41 

0.00 


B 

0 

6 

13 

0.16 

0.27 

18 

C 

1 

16 

35 

1.77 

0.27 

A 

0 

23 

50 

1.06 

0.13 


Missing 

** 

1 

2 

-1.41 

0.00 


B 

0 

2 

4 

0.94 

0.15 

19 

A 

0 

11 

24 

0.84 

0.28 

C 

1 

32 

70 

1.33 

0.17 


Missing 

** 

1 

2 

-1.41 

0.00 


C 

0 

8 

17 

0.29 

0.28 

20 

B 

0 

17 

37 

1.19 

0.22 

A 

1 

20 

43 

1.56 

0.19 


Missing 

** 

1 

2 

-1.41 

0.00 
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Table 5 

Item Statistics, Entry Order, Grade 4, Form 10 


Item Number 

Raw 

Score 

Count 

Measure 

Model Standard 
Error 

Mean Square 
Outfit 

1 

45 

49 

-1.70 

-0.41 

0.64 

2 

45 

49 

-1.70 

-0.46 

0.61 

3 

48 

49 

-3.31 

-0.73 

0.08 

4 

35 

49 

0.06 

1.45 

1.36 

5 

28 

49 

0.80 

0.51 

1.08 

6 

23 

49 

1.28 

0.76 

1.12 

7 

38 

49 

-0.32 

-0.47 

0.84 

8 

30 

49 

0.60 

0.66 

1.11 

9 

24 

49 

1.19 

-0.96 

0.84 

10 

24 

49 

1.19 

-0.60 

0.90 

11 

36 

49 

-0.06 

2.30 

1.67 

12 

18 

49 

1.77 

1.13 

1.25 

13 

33 

49 

0.29 

-0.65 

0.86 

14 

40 

49 

-0.62 

-0.78 

0.71 

15 

38 

49 

-0.32 

-1.08 

0.69 

16 

37 

49 

-0.19 

1.00 

1.27 

17 

25 

49 

1.09 

-0.54 

0.91 

18 

40 

49 

-0.62 

-0.26 

0.87 

19 

34 

49 

0.18 

-0.36 

0.91 

20 

32 

49 

0.40 

-0.86 

0.83 



Table 6 


Distractor Analysis, Grade 4, Form 10 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. Mean 


B 

0 

1 

2 

-2.16 

0.00 

1 

A 

0 

3 

6 

0.49 

0.08 

C 

1 

45 

92 

1.22 

0.13 


Missing 

** 






B 

0 

0 

0 

0.00 

0.00 


C 

0 

3 

6 

-0.67 

0.80 

L 

A 

1 

45 

92 

1.23 

0.12 


Missing 

** 

1 

2 

0.83 

0.00 


C 

0 

0 

0 

0.00 

0.00 

\ 

A 

0 

1 

2 

-2.16 

0.00 

J 

B 

1 

48 

98 

1.17 

0.12 


Missing 

** 






B 

0 

1 

2 

0.07 

0.00 

A 

A 

0 

11 

22 

1.02 

0.24 


C 

1 

35 

71 

1.20 

0.17 


Missing 

** 

2 

4 

0.33 

0.50 


A 

0 

9 

18 

0.86 

0.23 

c 

C 

0 

10 

20 

0.76 

0.37 


B 

1 

28 

57 

1.36 

0.17 


Missing 

** 

2 

4 

0.33 

0.50 


C 

0 

8 

16 

0.79 

0.30 

c. 

A 

0 

16 

33 

0.98 

0.29 

0 

B 

1 

23 

47 

1.36 

0.16 


Missing 

** 

2 

4 

0.33 

0.50 


B 

0 

1 

2 

1.39 

0.00 

n 

C 

0 

8 

16 

0.22 

0.40 

/ 

A 

1 

38 

78 

1.32 

0.13 


Missing 

** 

2 

4 

0.33 

0.50 


B 

0 

1 

2 

1.72 

0.00 

8 

A 

0 

15 

31 

0.78 

0.29 

C 

1 

30 

61 

1.29 

0.15 


Missing 

** 

3 

6 

0.68 

0.46 


C 

0 

5 

10 

-0.02 

0.68 

o 

B 

0 

18 

37 

0.87 

0.15 

y 

A 

1 

24 

49 

1.57 

0.15 


Missing 

** 

2 

4 

0.33 

0.50 


C 

0 

6 

12 

0.13 

0.50 

10 

A 

0 

16 

33 

1.09 

0.21 

B 

1 

24 

49 

1.53 

0.12 


Missing 

** 

3 

6 

-0.29 

0.39 



Table 6 


Distractor Analysis, Grade 4, Form 10 (Continued) 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. Mean 


B 

0 

1 

2 

0.07 

0.00 

11 

A 

0 

10 

20 

1.38 

0.29 

C 

1 

36 

73 

1.15 

0.15 


Missing 

** 

2 

4 

-0.59 

0.41 


B 

0 

10 

20 

1.28 

0.24 

12 

C 

0 

17 

35 

1.01 

0.27 

A 

1 

18 

37 

1.29 

0.19 


Missing 

** 

4 

8 

0.20 

0.51 


C 

0 

5 

10 

0.47 

0.69 

13 

A 

0 

9 

18 

0.69 

0.26 

B 

1 

33 

67 

1.41 

0.12 


Missing 

** 

2 

4 

-0.59 

0.41 


C 

0 

2 

4 

-0.79 

1.36 

14 

B 

0 

5 

10 

0.60 

0.35 

A 

1 

40 

82 

1.34 

0.11 


Missing 

** 

2 

4 

-0.59 

0.41 


A 

0 

2 

4 

-1.30 

0.86 

15 

B 

0 

6 

12 

0.80 

0.19 

C 

1 

38 

78 

1.39 

0.12 


Missing 

** 

3 

6 

-0.28 

0.36 


C 

0 

4 

8 

1.39 

0.38 

16 

B 

0 

5 

10 

0.87 

0.38 

A 

1 

37 

76 

1.22 

0.15 


Missing 

** 

3 

6 

-0.28 

0.36 


C 

0 

10 

20 

0.64 

0.39 

17 

A 

0 

11 

22 

0.98 

0.23 

B 

1 

25 

51 

1.51 

0.13 


Missing 

** 

3 

6 

-0.28 

0.36 


C 

0 

2 

4 

-0.92 

1.24 

18 

B 

0 

3 

6 

1.00 

0.72 

A 

1 

40 

82 

1.31 

0.12 


Missing 

** 

4 

8 

0.06 

0.43 


A 

0 

4 

8 

0.15 

1.00 

19 

B 

0 

8 

16 

0.73 

0.24 

C 

1 

34 

69 

1.42 

0.11 


Missing 

** 

3 

6 

-0.28 

0.36 


A 

0 

3 

6 

0.67 

0.23 

20 

C 

0 

10 

20 

0.55 

0.40 

B 

1 

32 

65 

1.45 

0.12 


Missing 

** 

4 

8 

0.06 

0.43 



p. 41 


Table 7 

Item Statistics, Entry Order, Grade 4, Form 14 


Item Number 

Raw 

Score 

Count 

Measure 

Model Standard 
Error 

Mean Square 
Outfit 

1 

48 

51 

-2.13 

-0.36 

0.57 

2 

49 

51 

-2.59 

0.65 

1.35 

3 

46 

51 

-1.53 

-0.59 

0.61 

4 

23 

51 

1.23 

0.68 

1.10 

5 

45 

51 

-1.31 

-0.71 

0.61 

6 

29 

51 

0.68 

3.02 

1.55 

7 

34 

51 

0.20 

0.14 

1.01 

8 

39 

51 

-0.37 

-0.97 

0.72 

9 

19 

51 

1.61 

1.41 

1.27 

10 

33 

51 

0.30 

0.20 

1.03 

11 

23 

51 

1.23 

-1.49 

0.78 

12 

18 

51 

1.71 

0.36 

1.06 

13 

19 

51 

1.61 

1.48 

1.28 

14 

46 

51 

-1.53 

-0.82 

0.51 

15 

33 

51 

0.30 

-1.02 

0.80 

16 

46 

51 

-1.53 

-0.47 

0.65 

17 

26 

51 

0.96 

0.33 

1.04 

18 

16 

51 

1.91 

1.11 

1.25 

19 

43 

51 

-0.94 

-0.62 

0.71 

20 

34 

51 

0.20 

-0.24 

0.94 



Table 8 


Distractor Analysis, Grade 4, Form 14 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. Mean 


A 

0 

1 

2 

-0.84 

0.00 

1 

C 

0 

2 

4 

-0.30 

1.21 

B 

1 

48 

94 

1.09 

0.12 


Missing 

** 






A 

0 

0 

0 

0.00 

0.00 


B 

0 

2 

4 

0.34 

1.18 

L 

C 

1 

49 

96 

1.02 

0.13 


Missing 

** 






A 

0 

1 

2 

-0.24 

0.00 

\ 

B 

0 

4 

8 

-0.13 

0.52 

J 

C 

1 

46 

90 

1.12 

0.13 


Missing 

** 






B 

0 

5 

10 

-0.04 

0.50 

A 

A 

1 

23 

45 

1.30 

0.20 


C 

0 

23 

45 

0.92 

0.14 


Missing 

** 






C 

0 

1 

2 

-0.24 

0.00 


A 

0 

5 

10 

-0.03 

0.39 


B 

1 

45 

88 

1.14 

0.13 


Missing 

** 






B 

0 

4 

8 

0.04 

0.76 

c. 

C 

0 

18 

35 

1.12 

0.21 

0 

A 

1 

29 

57 

1.06 

0.15 


Missing 

** 






A 

0 

2 

4 

0.33 

0.57 

n 

B 

0 

13 

25 

0.74 

0.14 

/ 

C 

1 

34 

67 

1.19 

0.17 


Missing 

** 

2 

4 

0.03 

0.87 


B 

0 

2 

4 

-0.09 

0.14 

8 

C 

0 

10 

20 

0.21 

0.27 

A 

1 

39 

76 

1.26 

0.13 


Missing 

** 






A 

0 

12 

24 

0.45 

0.23 

o 

B 

1 

19 

37 

1.12 

0.19 

y 

C 

0 

19 

37 

1.27 

0.23 


Missing 

** 

1 

2 

0.05 

0.00 


B 

0 

4 

8 

0.62 

0.24 

10 

A 

0 

14 

27 

0.57 

0.24 

C 

1 

33 

65 

1.22 

0.16 


Missing 

** 







Table 8 


Distractor Analysis, Grade 4, Form 14 (Continued) 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. Mean 


B 

0 

8 

16 

0.39 

0.31 

11 

C 

0 

19 

37 

0.67 

0.15 

A 

1 

23 

45 

1.56 

0.17 


Missing 

** 

1 

2 

-0.84 

0.00 


A 

0 

14 

27 

0.75 

0.23 

12 

B 

0 

18 

35 

0.93 

0.20 

C 

1 

18 

35 

1.36 

0.22 


Missing 

** 

1 

2 

-0.84 

0.00 


C 

0 

7 

14 

0.38 

0.23 

13 

B 

1 

19 

37 

1.40 

0.26 

A 

0 

23 

45 

0.95 

0.14 


Missing 

** 

2 

4 

-0.11 

0.73 


C 

0 

1 

2 

0.33 

0.00 

14 

B 

0 

3 

6 

-0.38 

0.62 

A 

1 

46 

90 

1.14 

0.12 


Missing 

** 

1 

2 

-0.84 

0.00 


A 

0 

2 

4 

-0.73 

0.78 

15 

C 

0 

15 

29 

0.61 

0.17 

B 

1 

33 

65 

1.34 

0.14 


Missing 

** 

1 

2 

-0.84 

0.00 


C 

0 

1 

2 

-1.51 

0.00 

16 

A 

0 

3 

6 

0.62 

0.00 

B 

1 

46 

90 

1.12 

0.13 


Missing 

** 

1 

2 

-0.84 

0.00 


B 

0 

1 

2 

1.21 

0.00 

17 

A 

0 

22 

43 

0.65 

0.14 

C 

1 

26 

51 

1.37 

0.19 


Missing 

** 

2 

4 

-0.11 

0.73 


C 

0 

2 

4 

-1.17 

0.33 

18 

A 

1 

16 

31 

1.20 

0.23 

B 

0 

31 

61 

1.11 

0.13 


Missing 

** 

2 

4 

-0.11 

0.73 


A 

0 

1 

2 

0.91 

0.00 

19 

B 

0 

4 

8 

0.11 

0.40 

C 

1 

43 

84 

1.16 

0.13 


Missing 

** 

3 

6 

-0.15 

0.42 


C 

0 

6 

12 

0.33 

0.28 

20 

B 

0 

9 

18 

0.61 

0.39 

A 

1 

34 

67 

1.28 

0.13 


Missing 

** 

2 

4 

-0.11 

0.73 
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Table 9 


Item Statistics, Entry Order, Grade 4, Form 15 


Item Number 

Raw 

Score 

Count 

Measure 

Model Standard 
Error 

Mean Square 
Outfit 

1 

44 

49 

-1.48 

1.44 

2.00 

2 

44 

49 

-1.48 

-0.71 

0.53 

3 

44 

49 

-1.48 

-0.22 

0.75 

4 

41 

49 

-0.89 

-0.61 

0.69 

5 

28 

49 

0.7 

0.08 

1.00 

6 

39 

49 

-0.58 

-0.82 

0.69 

7 

17 

49 

1.82 

0.76 

1.15 

8 

29 

49 

0.6 

0.35 

1.05 

9 

30 

49 

0.49 

-0.83 

0.84 

10 

19 

49 

1.61 

1.71 

1.35 

11 

15 

49 

2.05 

0.22 

1.03 

12 

38 

49 

-0.44 

-0.17 

0.91 

13 

42 

49 

-1.06 

-0.65 

0.65 

14 

36 

49 

-0.18 

-0.77 

0.77 

15 

41 

49 

-0.89 

-0.69 

0.67 

16 

43 

49 

-1.26 

-0.15 

0.82 

17 

30 

49 

0.49 

0.07 

1.00 

18 

26 

49 

0.90 

0.32 

1.04 

19 

31 

49 

0.39 

0.64 

1.12 

20 

28 

49 

0.70 

-0.22 

0.95 



Table 10 


Distractor Analysis, Grade 4, Form 15 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. Mean 


B 

0 

0 

0 

0.00 

0.00 

1 

C 

0 

4 

8 

1.18 

0.52 

A 

1 

44 

90 

1.09 

0.15 


Missing 

** 

1 

2 

-0.80 

0.00 


C 

0 

1 

2 

-0.02 

0.00 


A 

0 

3 

6 

0.07 

0.17 

L 

B 

1 

44 

90 

1.19 

0.15 


Missing 

** 

1 

2 

-0.80 

0.00 


B 

0 

1 

2 

-0.27 

0.00 

\ 

A 

0 

3 

6 

0.60 

0.32 

J 

C 

1 

44 

90 

1.16 

0.15 


Missing 

** 

1 

2 

-0.80 

0.00 


A 

0 

4 

8 

-0.28 

0.21 

A 

C 

0 

4 

8 

0.46 

0.38 


B 

1 

41 

84 

1.25 

0.15 


Missing 

** 

0 

0 

0.00 

0.00 


A 

0 

6 

12 

0.36 

0.33 

c 

B 

0 

15 

31 

0.66 

0.28 


C 

1 

28 

57 

1.42 

0.17 


Missing 

** 






B 

0 

1 

2 

-1.08 

0.00 

c. 

C 

0 

9 

18 

0.25 

0.19 

0 

A 

1 

39 

80 

1.30 

0.16 


Missing 

** 






C 

0 

5 

10 

-0.12 

0.35 

n 

B 

1 

17 

35 

1.50 

0.24 

/ 

A 

0 

26 

53 

1.04 

0.18 


Missing 

** 

1 

2 

-0.27 

0.00 


A 

0 

7 

14 

0.36 

0.23 

8 

B 

0 

12 

24 

0.56 

0.28 

C 

1 

29 

59 

1.38 

0.18 


Missing 

** 

1 

2 

2.65 

0.00 


A 

0 

6 

12 

-0.10 

0.29 

O 

C 

0 

13 

27 

0.62 

0.19 

7 

B 

1 

30 

61 

1.48 

0.18 


Missing 

** 






B 

0 

11 

22 

0.75 

0.25 

10 

A 

1 

19 

39 

1.33 

0.24 

C 

0 

19 

39 

0.96 

0.25 


Missing 

** 







Table 10 


Distractor Analysis, Grade 4, Form 15 (Continued) 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. Mean 


B 

0 

12 

24 

0.50 

0.29 

11 

A 

1 

15 

31 

1.68 

0.27 

C 

0 

19 

39 

0.93 

0.21 


Missing 

** 

3 

6 

0.97 

0.20 


C 

0 

2 

4 

0.94 

0.43 

12 

A 

0 

8 

16 

0.32 

0.26 

B 

1 

38 

78 

1.23 

0.17 


Missing 

** 

1 

2 

0.77 

0.00 


C 

0 

2 

4 

-0.14 

0.13 

13 

A 

0 

5 

10 

0.14 

0.31 

B 

Missing 

1 

** 

42 

86 

1.22 

0.15 


A 

0 

6 

12 

0.10 

0.40 

14 

B 

0 

7 

14 

0.33 

0.20 

C 

Missing 

1 

** 

36 

73 

1.36 

0.16 


B 

0 

3 

6 

0.07 

0.58 

15 

C 

0 

4 

8 

-0.01 

0.39 

A 

1 

41 

84 

1.25 

0.15 


Missing 

** 

1 

2 

0.24 

0.00 


B 

0 

2 

4 

0.39 

0.67 

16 

A 

0 

3 

6 

0.25 

0.67 

C 

1 

43 

88 

1.18 

0.15 


Missing 

** 

1 

2 

-0.27 

0.00 


C 

0 

7 

14 

0.53 

0.25 

17 

B 

0 

11 

22 

0.67 

0.27 

A 

1 

30 

61 

1.36 

0.19 


Missing 

** 

1 

2 

-0.27 

0.00 


C 

0 

6 

12 

-0.05 

0.33 

18 

B 

0 

16 

33 

0.98 

0.23 

A 

1 

26 

53 

1.41 

0.19 


Missing 

** 

1 

2 

-0.27 

0.00 


A 

0 

2 

4 

-0.68 

0.40 

19 

C 

0 

15 

31 

0.85 

0.25 

B 

1 

31 

63 

1.31 

0.17 


Missing 

** 

1 

2 

-0.27 

0.00 


A 

0 

8 

16 

0.4 

0.28 

20 

B 

0 

12 

24 

0.68 

0.24 

C 

1 

28 

57 

1.45 

0.19 


Missing 

** 

1 

2 

-0.27 

0.00 
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Table 1 1 


Item Statistics, Entry Order, Grade 4, Form 16 


Item Number 

Raw 

Score 

Count 

Measure 

Model Standard 
Error 

Mean Square 
Outfit 

1 

45 

52 

-1.40 

-0.83 

0.59 

2 

48 

52 

-2.17 

0.71 

1.38 

3 

50 

52 

-3.06 

-0.32 

0.37 

4 

36 

52 

-0.09 

1.01 

1.23 

5 

45 

52 

-1.40 

0.26 

1.05 

6 

36 

52 

-0.09 

0.00 

0.98 

7 

25 

52 

0.99 

0.23 

1.03 

8 

40 

52 

-0.58 

-0.60 

0.80 

9 

36 

52 

-0.09 

-0.49 

0.87 

10 

13 

52 

2.18 

1.49 

1.63 

11 

30 

52 

0.52 

1.23 

1.24 

12 

30 

52 

0.52 

-0.94 

0.82 

13 

30 

52 

0.52 

0.09 

1.01 

14 

44 

52 

-1.20 

-1.04 

0.57 

15 

12 

52 

2.30 

-0.10 

0.90 

16 

41 

52 

-0.72 

-1.71 

0.51 

17 

30 

52 

0.52 

0.22 

1.03 

18 

37 

52 

-0.21 

0.35 

1.07 

19 

26 

52 

0.90 

3.67 

1.90 

20 

10 

52 

2.56 

0.07 

0.96 



Table 12 


Distractor Analysis, Grade 4, Form 16 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. Mean 


C 

0 

1 

2 

0.28 

0.00 

1 

A 

0 

4 

8 

-0.28 

0.34 

B 

1 

45 

87 

1.05 

0.14 


Missing 

** 

2 

4 

-1.58 

1.31 


C 

0 

0 

0 

0.00 

0.00 


B 

0 

2 

4 

0.52 

1.35 

L 

A 

1 

48 

92 

0.95 

0.14 


Missing 

** 

2 

4 

-1.58 

1.31 


A 

0 

0 

0 

0.00 

0.00 

\ 

B 

0 

0 

0 

0.00 

0.00 

J 

C 

1 

50 

96 

0.93 

0.14 


Missing 

** 

2 

4 

-1.58 

1.31 


C 

0 

5 

10 

0.49 

0.37 

A 

A 

0 

8 

15 

1.06 

0.27 


B 

1 

36 

69 

1.05 

0.15 


Missing 

** 

3 

6 

-1.82 

0.79 


A 

0 

1 

2 

1.16 

0.00 

c 

C 

0 

3 

6 

0.95 

0.10 


B 

1 

45 

87 

1.00 

0.13 


Missing 

** 

3 

6 

-1.82 

0.79 


B 

0 

4 

8 

1.22 

0.41 

/Z 

A 

0 

9 

17 

0.19 

0.31 

O 

C 

1 

36 

69 

1.17 

0.12 


Missing 

** 

3 

6 

-1.82 

0.79 


B 

0 

3 

6 

0.82 

0.62 

n 

C 

0 

22 

42 

0.70 

0.18 

/ 

A 

1 

25 

48 

1.22 

0.17 


Missing 

** 

2 

4 

-2.60 

0.29 


A 

0 

2 

4 

0.75 

0.74 

8 

B 

0 

8 

15 

0.21 

0.25 

C 

1 

40 

77 

1.13 

0.13 


Missing 

** 

2 

4 

-2.60 

0.29 


A 

0 

5 

10 

0.51 

0.35 

O 

C 

0 

9 

17 

0.35 

0.30 

7 

B 

1 

36 

69 

1.19 

0.13 


Missing 

** 

2 

4 

-2.60 

0.29 


A 

1 

13 

25 

1.18 

0.33 

10 

C 

0 

15 

29 

0.79 

0.22 

B 

0 

22 

42 

0.97 

0.13 


Missing 

** 

2 

4 

-2.60 

0.29 



Table 12 


Distractor Analysis, Grade 4, Form 16 (Continued) 


Entry # 

Data Code 

Score Value 

Count 

% 

Average 

Measure 

S.E. Mean 


B 

0 

0 

0 

0.00 

0.00 

11 

C 

0 

20 

38 

0.84 

0.23 

A 

1 

30 

58 

1.06 

0.14 


Missing 

** 

2 

4 

-2.60 

0.29 


C 

0 

6 

12 

0.38 

0.28 

12 

A 

0 

13 

25 

0.58 

0.24 

B 

1 

30 

58 

1.32 

0.13 


Missing 

** 

3 

6 

-2.01 

0.61 


B 

0 

7 

13 

0.97 

0.25 

13 

A 

0 

12 

23 

0.56 

0.27 

C 

1 

30 

58 

1.19 

0.14 


Missing 

** 

3 

6 

-2.01 

0.61 


C 

0 

2 

4 

-0.28 

0.56 

14 

A 

0 

3 

6 

0.38 

0.25 

B 

1 

44 

85 

1.11 

0.12 


Missing 

** 

3 

6 

-2.01 

0.61 


C 

1 

12 

23 

1.58 

0.23 

15 

B 

0 

16 

31 

1.04 

0.20 

A 

0 

20 

38 

0.73 

0.14 


Missing 

** 

4 

8 

-1.72 

0.52 


A 

0 

1 

2 

0.28 

0.00 

16 

B 

0 

7 

13 

-0.11 

0.23 

C 

1 

41 

79 

1.22 

0.11 


Missing 

** 

3 

6 

-2.01 

0.61 


C 

0 

2 

4 

0.14 

0.14 

17 

B 

0 

17 

33 

0.76 

0.19 

A 

1 

30 

58 

1.20 

0.15 


Missing 

** 

3 

6 

-2.01 

0.61 


C 

0 

6 

12 

0.72 

0.41 

18 

A 

0 

7 

13 

-0.19 

0.12 

B 

1 

37 

71 

1.18 

0.15 


Missing 

** 

2 

4 

-1.57 

0.74 


A 

0 

8 

15 

0.62 

0.28 

19 

B 

0 

16 

31 

0.91 

0.21 

C 

1 

26 

50 

1.04 

0.22 


Missing 

** 

2 

4 

-1.57 

0.74 


A 

1 

10 

19 

1.40 

0.20 

20 

B 

0 

16 

31 

0.54 

0.31 

C 

0 

24 

46 

0.99 

0.18 


Missing 

** 

2 

4 

-1.57 

0.74 



